智能论文笔记

Development of a face mask detection pipeline for mask-wearing monitoring in the era of the COVID-19 pandemic: A modular approach

Benjaphan Sommana , Ukrit Watchareeruetai , Ankush Ganguly , Samuel W. F. Earp , Taya Kitiyakara , Suparee Boonmanunt , Ratchainant Thammasudjarit

分类：计算机视觉 | 机器学习

2021-12-30

在SARS-COV-2大流行期间，戴着面膜穿着成为防止传播和收缩病毒的有效工具。监测人口中面膜速率的能力将用于确定对病毒的公共卫生策略。然而，用于检测面罩的人工智能技术尚未在现实生活中以大规模部署在公共场合的大规模中。在本文中，我们介绍了由两个单独的模块组成的两步面掩模检测方法：1）面部检测和对准，2）面掩模分类。这种方法使我们能够尝试不同的面部检测和面罩分类模块的组合。更具体地说，我们尝试使用金字塔和视网膜作为面部探测器，同时保持面罩分类模块的轻质骨干。此外，我们还提供了Aizoo数据集的测试集的重叠注释，在那里我们纠正了某些面部图像的错误标签。 Aizoo和Moxa 3K数据集的评估结果表明，所提出的面罩检测管道超越了最先进的方法。所提出的管道在AIZOO数据集的重叠测试组上也产生了比原始测试集更高的映射。由于我们使用野外的面部图像培训了所提出的模型，我们可以成功部署我们的模型来使用公共CCTV图像监控戴掩模速率。

translated by 谷歌翻译

LOTR: Face Landmark Localization Using Localization Transformer

Ukrit Watchareeruetai , Benjaphan Sommana , Sanjana Jain , Pavit Noinongyao , Ankush Ganguly , Aubin Samacoits , Samuel W. F. Earp , Nakarin Sritrakool

分类：计算机视觉 | 人工智能 | 机器学习

2021-09-21

本文提出了一种名为定位变压器（LOTR）的新型变压器的面部地标定位网络。所提出的框架是一种直接坐标回归方法，利用变压器网络以更好地利用特征图中的空间信息。 LOTR模型由三个主要模块组成：1）将输入图像转换为特征图的视觉骨干板，2）改进Visual Backone的特征表示，以及3）直接预测的地标预测头部的变压器模块来自变压器的代表的地标坐标。给定裁剪和对齐的面部图像，所提出的LOTR可以训练结束到底，而无需任何后处理步骤。本文还介绍了光滑翼损失功能，它解决了机翼损耗的梯度不连续性，导致比L1，L2和机翼损耗等标准损耗功能更好地收敛。通过106点面部地标定位的第一个大挑战提供的JD地标数据集的实验结果表明了LOTR在排行榜上的现有方法和最近基于热爱的方法的优势。在WFLW DataSet上，所提出的Lotr框架与若干最先进的方法相比，展示了有希望的结果。此外，我们在使用我们提出的LOTRS面向对齐时，我们报告了最先进的面部识别性能的提高。

translated by 谷歌翻译

An Introduction to Variational Inference

Ankush Ganguly , Samuel W. F. Earp

分类：机器学习 | 人工智能 | (统计)机器学习

2021-08-30

近似复杂的概率密度是现代统计中的核心问题。在本文中，我们介绍了变分推理（VI）的概念，这是一种机器学习中的流行方法，该方法使用优化技术来估计复杂的概率密度。此属性允许VI汇聚速度比经典方法更快，例如Markov Chain Monte Carlo采样。概念上，VI通过选择一个概率密度函数，然后找到最接近实际概率密度的家庭 - 通常使用Kullback-Leibler（KL）发散作为优化度量。我们介绍了缩窄的证据，以促进近似的概率密度，我们审查了平均场变分推理背后的想法。最后，我们讨论VI对变分式自动编码器（VAE）和VAE-生成的对抗网络（VAE-GAN）的应用。用本文，我们的目标是解释VI的概念，并通过这种方法协助协助。

translated by 谷歌翻译

AI applications in forest monitoring need remote sensing benchmark datasets

Emily R. Lines , Matt Allen , Carlos Cabo , Kim Calders , Amandine Debus , Stuart W. D. Grieve , Milto Miltiadou , Adam Noach , Harry J. F. Owen , Stefano Puliti

分类：人工智能

2022-12-20

With the rise in high resolution remote sensing technologies there has been an explosion in the amount of data available for forest monitoring, and an accompanying growth in artificial intelligence applications to automatically derive forest properties of interest from these datasets. Many studies use their own data at small spatio-temporal scales, and demonstrate an application of an existing or adapted data science method for a particular task. This approach often involves intensive and time-consuming data collection and processing, but generates results restricted to specific ecosystems and sensor types. There is a lack of widespread acknowledgement of how the types and structures of data used affects performance and accuracy of analysis algorithms. To accelerate progress in the field more efficiently, benchmarking datasets upon which methods can be tested and compared are sorely needed. Here, we discuss how lack of standardisation impacts confidence in estimation of key forest properties, and how considerations of data collection need to be accounted for in assessing method performance. We present pragmatic requirements and considerations for the creation of rigorous, useful benchmarking datasets for forest monitoring applications, and discuss how tools from modern data science can improve use of existing data. We list a set of example large-scale datasets that could contribute to benchmarking, and present a vision for how community-driven, representative benchmarking initiatives could benefit the field.

translated by 谷歌翻译

Relative Sparsity for Medical Decision Problems

Samuel J. Weisenthal , Sally W. Thurston , Ashkan Ertefaie

分类：机器学习

2022-11-29

Existing statistical methods can be used to estimate a policy, or a mapping from covariates to decisions, which can then instruct decision makers. There is great interest in using such data-driven policies in healthcare. In healthcare, however, it is often important to explain to the healthcare provider, and to the patient, how a new policy differs from the current standard of care. This end is facilitated if one can pinpoint the aspects (i.e., parameters) of the policy that change most when moving from the standard of care to the new, suggested policy. To this end, we adapt ideas from Trust Region Policy Optimization. In our work, however, unlike in Trust Region Policy Optimization, the difference between the suggested policy and standard of care is required to be sparse, aiding with interpretability. In particular, we trade off between maximizing expected reward and minimizing the $L_1$ norm divergence between the parameters of the two policies. This yields "relative sparsity," where, as a function of a tuning parameter, $\lambda$, we can approximately control the number of parameters in our suggested policy that differ from their counterparts in the standard of care. We develop our methodology for the observational data setting. We propose a problem-specific criterion for selecting $\lambda$, perform simulations, and illustrate our method with a real, observational healthcare dataset, deriving a policy that is easy to explain in the context of the current standard of care. Our work promotes the adoption of data-driven decision aids, which have great potential to improve health outcomes.

translated by 谷歌翻译

Learning Bilinear Models of Actuated Koopman Generators from Partially-Observed Trajectories

Samuel E. Otto , Sebastian Peitz , Clarence W. Rowley

分类：机器学习

2022-09-20

基于近似基础的Koopman操作员或发电机的数据驱动的非线性动力系统模型已被证明是预测，功能学习，状态估计和控制的成功工具。众所周知，用于控制膜系统的Koopman发电机还对输入具有仿射依赖性，从而导致动力学的方便有限维双线性近似。然而，仍然存在两个主要障碍，限制了当前方法的范围，以逼近系统的koopman发电机。首先，现有方法的性能在很大程度上取决于要近似Koopman Generator的基础函数的选择；目前，目前尚无通用方法来为无法衡量保存的系统选择它们。其次，如果我们不观察到完整的状态，我们可能无法访问足够丰富的此类功能来描述动态。这是因为在有驱动时，通常使用时间延迟的可观察物的方法失败。为了解决这些问题，我们将Koopman Generator控制的可观察到的动力学写为双线性隐藏Markov模型，并使用预期最大化（EM）算法确定模型参数。 E-Step涉及标准的Kalman滤波器和更光滑，而M-Step类似于发电机的控制效果模式分解。我们在三个示例上证明了该方法的性能，包括恢复有限的Koopman-Invariant子空间，用于具有缓慢歧管的驱动系统；估计非强制性行驶方程的Koopman本征函数；仅基于提升和阻力的嘈杂观察，对流体弹球系统的模型预测控制。

translated by 谷歌翻译

Data-Driven Blind Synchronization and Interference Rejection for Digital Communication Signals

Alejandro Lancho , Amir Weiss , Gary C. F. Lee , Jennifer Tang , Yuheng Bu , Yury Polyanskiy , Gregory W. Wornell

分类：人工智能 | 机器学习

2022-09-11

我们研究了数据驱动的深度学习方法的潜力，即从观察它们的混合物中分离两个通信信号。特别是，我们假设一个信号之一的生成过程（称为感兴趣的信号（SOI）），并且对第二个信号的生成过程不了解，称为干扰。单通道源分离问题的这种形式也称为干扰拒绝。我们表明，捕获高分辨率的时间结构（非平稳性），可以准确地同步与SOI和干扰，从而带来了可观的性能增长。有了这个关键的见解，我们提出了一种域信息神经网络（NN）设计，该设计能够改善“现成” NNS和经典检测和干扰拒绝方法，如我们的模拟中所示。我们的发现突出了特定于交流领域知识的关键作用在开发数据驱动的方法方面发挥了作用，这些方法具有前所未有的收益的希望。

translated by 谷歌翻译

Graph Neural Networks for Low-Energy Event Classification & Reconstruction in IceCube

R. Abbasi , M. Ackermann , J. Adams , N. Aggarwal , J. A. Aguilar , M. Ahlers , M. Ahrens , J. M. Alameddine , A. A. Alves Jr. , N. M. Amin

分类：机器学习

2022-09-07

ICECUBE是一种用于检测1 GEV和1 PEV之间大气和天体中微子的光学传感器的立方公斤阵列，该阵列已部署1.45 km至2.45 km的南极的冰盖表面以下1.45 km至2.45 km。来自ICE探测器的事件的分类和重建在ICeCube数据分析中起着核心作用。重建和分类事件是一个挑战，这是由于探测器的几何形状，不均匀的散射和冰中光的吸收，并且低于100 GEV的光，每个事件产生的信号光子数量相对较少。为了应对这一挑战，可以将ICECUBE事件表示为点云图形，并将图形神经网络（GNN）作为分类和重建方法。 GNN能够将中微子事件与宇宙射线背景区分开，对不同的中微子事件类型进行分类，并重建沉积的能量，方向和相互作用顶点。基于仿真，我们提供了1-100 GEV能量范围的比较与当前ICECUBE分析中使用的当前最新最大似然技术，包括已知系统不确定性的影响。对于中微子事件分类，与当前的IceCube方法相比，GNN以固定的假阳性速率（FPR）提高了信号效率的18％。另外，GNN在固定信号效率下将FPR的降低超过8（低于半百分比）。对于能源，方向和相互作用顶点的重建，与当前最大似然技术相比，分辨率平均提高了13％-20％。当在GPU上运行时，GNN能够以几乎是2.7 kHz的中位数ICECUBE触发速率的速率处理ICECUBE事件，这打开了在在线搜索瞬态事件中使用低能量中微子的可能性。

translated by 谷歌翻译

Deep filter bank regression for super-resolution of anisotropic MR brain images

Samuel W. Remedios , Shuo Han , Yuan Xue , Aaron Carass , Trac D. Tran , Dzung L. Pham , Jerry L. Prince

分类：计算机视觉

2022-09-06

在2D多板磁共振（MR）采集中，平面信号通常比面内信号较低。尽管当代超分辨率（SR）方法旨在恢复基本的高分辨率量，但估计的高频信息是通过端到端数据驱动的培训隐含的，而不是明确说明和寻求。为了解决这个问题，我们根据完美的重建过滤库重新构架SR问题声明，使我们能够识别并直接估计缺失的信息。在这项工作中，我们提出了一种两阶段的方法，以近似于与特定扫描的各向异性采集相对应的完美重建过滤库。在第1阶段，我们使用梯度下降估算缺失的过滤器，在第2阶段，我们使用深网来学习从粗系数到细节系数的映射。此外，提出的公式不依赖外部训练数据，从而规避了对域移位校正的需求。在我们的方法下，特别是在“切片差距”方案中提高了SR性能，这可能是由于框架施加的解决方案空间的限制。

translated by 谷歌翻译

Fluorescence molecular optomic signatures improve identification of tumors in head and neck specimens

Yao Chen , Samuel S. Streeter , Brady Hunt , Hira S. Sardar , Jason R. Gunn , Laura J. Tafe , Joseph A. Paydarfar , Brian W. Pogue , Keith D. Paulsen , Kimberley S. Samkoe

分类：机器学习 | 计算机视觉

2022-08-29

在这项研究中，将放射学方法扩展到用于组织分类的光学荧光分子成像数据，称为“验光”。荧光分子成像正在出现在头颈部鳞状细胞癌（HNSCC）切除期间的精确手术引导。然而，肿瘤到正常的组织对比与靶分子表皮生长因子受体（EGFR）的异质表达的内在生理局限性混淆。验光学试图通过探测荧光传达的EGFR表达中的质地模式差异来改善肿瘤识别。从荧光图像样品中提取了总共1,472个标准化的验光特征。涉及支持矢量机分类器的监督机器学习管道接受了25个顶级功能的培训，这些功能由最小冗余最大相关标准选择。通过将切除组织的图像贴片分类为组织学确认的恶性肿瘤状态，将模型预测性能与荧光强度阈值方法进行了比较。与荧光强度阈值方法相比，验光方法在所有测试集样品中提供了一致的预测准确性（无剂量）（平均精度为89％vs. 81％； P = 0.0072）。改进的性能表明，将放射线学方法扩展到荧光分子成像数据为荧光引导手术中的癌症检测提供了有希望的图像分析技术。

translated by 谷歌翻译